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Abstract 

The claim of Meyer, Kent and Clifton (MKC) that finite precision 
measurement nullifies the Kochen-Specker theorem is criticised. It is 
argued that, although MKC have nullified the Kochen-Specker the- 
orem strictly so-called, there are other, related propositions which 
are not nullified. The argument given is an elaboration of some of 
Mermin's critical remarks. Although MKC allow for the fact that the 
observables to be measured cannot be precisely specified, they con- 
tinue to assume that the observables which are actually measured are 
strictly commuting. As Mermin points out, this assumption is unjus- 
tified. Consequently, the analysis of MKC is incomplete. To make it 
complete one needs to investigate the predictions their models make 
regarding approximate joint measurements of non-commuting observ- 
ables. Such an investigation is carried out, using methods previously 
developed in connection with approximate joint measurements of po- 
sition and momentum. It is shown that a form of contextuality then 
re-emerges. 



PACS numbers: 03.65.Bz, 03.67.Hk, 03.67.Lx 
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1. Introduction 

In a recent series of papers Meyer [Q, Kent [|j and Clifton and Kent |^ (to 
whom we will subsequently refer as MKC) claim to have "nullified" the Kochen- 
Specker theorem ||, |, |, 0. They infer that "there is no truly compelling argument 
establishing that non-relativistic quantum mechanics describes classically inexpli- 
cable physics" They suggest that this may have significant consequences for 
quantum information theory and quantum computing. 

The purpose of this paper is to criticize MKC's conclusions. It is true that 
MKC have circumvented the particular kind of non-classicality which features in 
the Kochen-Specker theorem. It is also true that in doing so they have significantly 
deepened our understanding of the conceptual implications of quantum mechanics. 
However, when it comes to the central question, as to whether non-relativistic quan- 
tum mechanics is classically explicable, it appears to us that a closer examination 
of their models leads to a different conclusion. We will argue that, although MKC 
have nullified the Kochen-Specker theorem strictly so-called, there are other, related 
propositions which are not nullified. The argument we will give is a development of 
some of the points made in Mcrmin's critique J|8| (for other critical comments see 
Havlicek et al [§, Cabello Q and Basu et al |(ll|). 

As Mermin points out, MKC's analysis of finite precision measurements is not 
entirely adequate. MKC only consider one source of non-ideality: namely, the non- 
ideality which is due to inaccuracies in the specification of the observables to be 
measured. In every other respect the measurements they considei]^ are perfectly 
ideal (using the word "ideal" in the sense defined in Section ||) . Such measurements 
might be rather better described (following Mermin) as ideal measurements which 
are not precisely specified. Consequently, the analysis of MKC is incomplete. In 
order to make it complete one needs to extend the analysis to the case of mea- 
surements which are not ideal in any respect: not ideal in respect of the target 
observables, which the apparatus is intended to measure; and not ideal in respect 
of any other observables either. The purpose of this paper is to present such an 
extended analysis. In the first part of the paper we give a more comprehensive 
account of approximate quantum mechanical measurements (based on ideas previ- 
ously presented in Appleby |l2| |l3j). In the second part we apply these results to 
the MKC models. 

It is important to distinguish the specific, technical result proved by Kochen 
and Specker, and the essential point of their argument. By the "essential point" 
we mean the proposition that quantum mechanics (whether relativistic or not) is 
inconsistent with classical conceptions of physical reality. 

In the theories of classical physics it was tacitly assumed 

1. To each observable quantity characterising a system there corresponds an 
objective physical quantity, which has a determinate value at every instant. 

2. An ideal, perfectly precise measurement gives, with certainty, a value which 
exactly coincides with the value which the quantity being measured objec- 
tively did possess, immediately before the measurement process was initiated. 

Of course, real laboratory measurements are not perfectly precise; and this fact was 
acknowledged in classical physics, just as it is in quantum physics. Consequently, 
the above propositions ought to be supplemented: 



^In the main part of their argument. The part of their argument which concerns (in their 
terminology) "positive operator measurements" will be discussed below (see Section H). 
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3. A non-ideal, approximate measurement gives, with high probability, a value 
which is close to the value which the quantity being measured objectively did 
possess, immediately before the measurement process was initiated. 

We will refer to these three propositions collectively as the principle of accessible 
objective values, or the AOV principle for short. Of course, if the AOV principle 
is not true, it does not necessarily follow that objective values do not exist. How- 
ever, if the postulated objective values are typically quite different from the values 
obtained by measurement, then it is difficult to see what is achieved by assuming 
them. Consequently, failure of the AOV principle can be taken (though need not 
necessarily be taken) to justify a positivistic view: on the grounds that "a wheel 
that can be turned though nothing else moves with it, is not part of the mecha- 
nism" (as Wittgenstein iQ succinctly put it, in a different context). This was (in 
essence) the perception which motivated the Copenhagen Interpretation. 

The significance of the Kochen-Specker theorem is that it seems to provide a 
rigorous proof that the AOV principle is inconsistent with the predictions of quan- 
tum mechanics. Kochen and Specker show that, if it is possible to make joint, ideal 
measurements of any set of commuting observables then, in a hidden variables the- 
ory, the result of making an ideal measurement of one observable must, in general, 
depend on which other commuting observables are jointly and ideally measured 
with it. This property is not consistent with clause ||of the AOV principle. 

The weakness in Kochen and Specker 's argument was identified by MKC, who 
noted that the observables to be measured cannot be specified with perfect pre- 
cision. Consequently, in an experiment which is intended to measure one set of 
commuting observables, the possibility cannot be excluded that what is actually 
measured is another, slightly different set of commuting observables. MKC use this 
freedom to construct a hidden variables theory which does satisfy clause |] of the 
AOV principle. It should be noted that the theory they construct is not strictly 
equivalent to standard quantum mechanics (because they postulate that an observ- 
able can only be measured if it belongs to a particular, proper subset of the set of 
all self-adjoint operators). Consequently, they have not shown that clause || of the 
AOV principle is consistent with all the predictions of quantum mechanics (i.e., 
they have not refuted the Kochen-Specker theorem). On the other hand, they have 
shown that this clause is consistent with the predictions of quantum mechanics in so 
far as these are empirically verifiable [i.e., they have nullified the Kochen-Specker 
theorem (strictly so-called)]. 

However, just as MKC have noted a significant weakness in the argument of 
Kochen and Specker, so in turn Mermin ||8| has noted a significant weakness in 
theirs. Although MKC allow for the fact that the observables which are actually 
measured may be slightly and uncontrollably different from the target observables, 
which the experiment is intended to measure, they nevertheless follow Kochen and 
Specker in assuming that the observables which are actually measured still are 
strictly commuting. But, as Mermin points out, MKC's own assumptions sug- 
gest that the observables which are actually measured will almost certainly not be 
strictly commuting. Also (and, as it turns out, closely connected with this point) 
one may ask: do the MKC models satisfy clause ^ of the AOV principle? Af- 
ter all, if one accepts MKC's starting point, that perfect precision is practically 
unattainable, then clause || of the AOV principle, relating as it does to approxi- 
mate measurements, must be the one which really matters. Clause H, by contrast, 
relating as it does to ideal measurements — which is to say practically unrealizable 
measurements — must be regarded as being of negligible importance. Yet clause ^ 
is the only one on which their argument bears. 
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In order to appreciate the force of Mermin's point it will be helpful to specialise 
to the standard example of a spin-1 particle, with angular momentum L. Kochen 
and Specker consider joint measurements of the three commuting projectors (er-L)^ 
for r = 1, 2, 3 where is any orthonormal triad in R"^. A schematic arrangement 
for performing such a measurement using three separate analyzers is illustrated 
m Fig. |. MKC correctly observe that, in such an arrangement, it would not 
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Figure 1. Schematic arrangement for jointly measuring the ob- 
servables (e^ • L)^. The system passes through a succession of 
analysers, which measure each observable in turn. 

practically be possible to align the analyzers precisely along the three directions 
e^. In practice one would expect there to be some uncontrollable errors, so that 
what are actually measured are the projections (ej, • L)^, where the triad is 
close, but not exactly coincident with the triad e^. Nevetheless, MKC assume 
that the triad is precisely orthonormal. Yet it seems clear that, given that the 
errors are random and uncontrollable, and given that the analyzers are separate 
instruments, one would typically expect there to be some slight departures from 
strict orthogonality. At the least, there are no evident grounds for assuming the 
contrary. 

MKC assume that the triad must be exactly orthonormal because they follow 
Kochen and Specker in relying on the principle that it is only sets of commuting 
observables which can jointly be measured with perfect accuracy. However, if one 
relaxes the requirement that the measurements be perfectly accurate, then this 
principle is no longer valid. There is now an extensive literature on the subject 
of joint, inexact measurements of non-commuting observables. To date, the topics 
which have received most attention are joint measurements of position and momen- 
tum 13, 1^, 17 , and joint measurements of the components of spin jTs], |9|. 



For recent reviews, and additional references, the reader may consult Busch et 
al [0, and Leonhardt It should be stressed that recent advances in the 

field of quantum optics mean that such measurements can now be realized, in the 
laboratory. 

The purpose of this paper is to investigate the consequences of extending the 
analysis of MKC, so as to include approximate joint measurements of non-commuting 
observable. 

The paper is in two main parts. The first part, comprising Sections |2|-^ is 
concerned with the theory of approximate measurements. The discussion in these 
sections is based on ideas previously presented in Appleby |12, |l^ (also see Ap- 



pleby ||l^, [T9|). In our earlier papers we were concerned with the two special 
cases, of approximate joint measurements of position and momentum [[l^ , p^ , [l^ , 
p^ , and approximate joint measurements of the components of spin |19|. In Sec- 
tions ^ and 1^ we show how the same methods can be used to analyse approximate 
measurements for any system having a finite dimensional state space (the extension 
to the case of a system having an infinite dimensional state space is straightforward. 
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but unnecessary for present purposes). In Section Q we discuss UfRnk's |22j criti- 
cisms of the description of approximate joint measurement processes which (unhke 
the approach taken in this paper) is based on the concept of an unsharp observable. 
In Section || we speciaHse the discussion to the case of approximate joint measure- 
ments of the operators (e'. -L)^ introduced above, where L is the angular momentum 
for a spin-1 system, and where the triad is not assumed to be orthonormal. 

The account in Sections ||-|| is somewhat lengthy. This is because there are some 
subtleties, and serious potential confusions, which make it necessary to give a de- 
tailed discussion of the underlying concepts. Uffink (also see Fleming [^) has 
identified some obscurities in the theory of joint measurements of non-commuting 
observables as it is presented by (for example) Busch et al . It so happens that 
these objections have a direct bearing on the questions addressed in this paper. One 
of the advantages of our approach is that Uffink's objections do not apply to it (the 
other advantage of our approach being that it leads to an improved ||l6| definition 
of measurement accuracy). The importance of this fact will appear in Section 
where we consider the argument of Clifton and Kent |^ which (they claim) "rule[s] 
out falsifications of non-contextual models based on generalized observables, repre- 
sented by POV measures" . 

In Section ^ we apply the concepts and methods developed in Sections |^-|| to 
joint measurements of the target observables (e^ • L)^, under circumstances where 
the triad is not precisely specified, so that the observables (e^ • L)^ which are 
actually measured cannot be assumed to be precisely commuting. We show that, if 
the errors in the alignments of the vectors are statistically independent, then it 
is possible to prove a modified version of the Kochen-Specker theorem (a Kochen- 
Specker theorem for approximate measurements, as it might be called), from which 
it follows that clause ^ of the AOV principle is not satisfied. This argument was 
originally inspired by some of the points made by Mermin (although it appears to 
us that our formulation is considerably sharper than Mermin's: moreover, Mermin 
does not remark on the need to assume that the errors are independent). 

The result proved in Section ^ shows that, if the alignment errors are inde- 
pendent, then the outcome of an approximate measurement must, in general, be 
strongly dependent on the particular manner in which the measurement is carried 
out. In other words, the theory must exhibit a kind of contextuality. However, it 
may be that there are theories of the MKC type for which the errors are not inde- 
pendent. This possibility is discussed in Section 0. We begin by remarking that, 
although it is conceivable that there exist theories of the type proposed by MKC 
for which the errors are not independent, and which do satisfy all three clauses 
of the AOV principle, it would not be straightforward actually to prove that this 
was the case. The distribution of errors is not a feature of the theory which one 
is free simply to postulate. In a complete theory it should be a consequence of 
the detailed dynamics of the interaction between the system, the measuring ap- 
paratus, and the environment. The models proposed by MKC are, as they stand, 
incomplete, since they do not include any dynamical postulate. In order to show 
that they satisfy clause || the AOV principle it would be necessary, first to specify 
the dynamical evolution of the hidden variables which characterise the interacting 
system-l-apparatus-fenvironment composite, and then to work out the distribution 
of errors which this implies. Moreover, one would need to establish that the as- 
sumption of independence fails in just the way that is required for clause ^ of the 
AOV principle to be satisfied; and one would need to show that this is the case 
for every possible system, and every possible set of measurements. Such a program 
would constitute a highly non-trivial theoretical undertaking. We may therefore 
conclude, in the first place, that it remains an open question, whether there exists 
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a hidden variables theory satisfying clause ^ of the AOV principle. Whether or not 
this clause can be nullified, it has not been nullified yet. 

In the second place it is to be observed that such a theory, if it could be con- 
structed, would entail the existence of a delicately adjusted collaboration between 
the ostensibly random fluctuations in the different parts of a composite apparatus. 
In Section |^ we argue that this would itself represent a kind of contextuality: for 
it would mean that the fluctuations in each component of a complex apparatus 
were, in general, intricately and inescapably dependent on the overall experimental 
context in which that component was employed. In other words, a theory of this 
kind would not so much eliminate the phenomenon of contextuality, as shift the 
locus of the contextuality, from the system, onto the fluctuations in the measuring 
apparatus. 

Our overall conclusion consequently is that, no matter how the theoretical pos- 
tulates are adjusted, some kind of contextuality must appear somewhere. 

Finally, in Section || we discuss Clifton and Kent's |^ theorem 2 which is in- 
tended to "rule out falsifications of non-contextual models based on generalized 
observables, represented by POV measures" (also see Kent j2)). Approximate joint 
measurements of non-commuting observables are most conveniently described us- 
ing a POVM, and so it may at first sight seem that Clifton and Kent's theorem 2 
contradicts the result proved in Section || of this paper. In fact, this is not the case, 
as we show in Section ^. The reason is connected with the point made in Section ^: 
namely, that although the concept of an approximate measurement involves the 
concept of a POVM, it does not involve the concept of a new kind of "generalized 
observable" , distinct from the ordinary kind of observable which is represented by 
a self-adjoint operator. We go on to discuss some other difficulties which arise from 
the way in which Clifton and Kent use the concept of a generalized observable. 



2. Approximate Measurements 

The purpose of this section and the one following is to give a general char- 
acterisation of approximate measurement (s) performed on a system having a fi- 
nite dimensional state space. The observables being measured may be commuting 
or non-commuting. Our approach is based on ideas previously presented in Ap- 
pleby [ p^ p^ , in connection with app roximate joint measurements of position and 
momentum (also see Appleby |lj 0). As discussed in Section H, our approach 
differs from the one taken by many other authors in that it makes no use of the 
concept of an unsharp observable. This will prove relevant in Section H, when we 
discuss Clifton and Kent's theorem 2. The basic physical ideas are described 
in this section. The mathematical elaboration in terms of POVM's is described in 
Section ^. 

An approximate measurement is a measurement which is less than perfectly ac- 
curate. It follows, that in order properly to characterise an approximate measure- 
ment it is necessary first to arrive at a satisfactory, quantum mechanical concept 
of measurement accuracy. This is the problem to which we now turn. We begin by 
considering the accuracy of an imperfect measurement of a single observable. We 
then extend the discussion to the case of simultaneous, imperfect measurements of 
a set of several different observables (commuting or non-commuting). 

The ordinary, intuitive concept of accuracy involves a comparison between the 
result of the measurement, and the original value which the quantity being measured 
did take, immediately before the measurement was carried out. In a quantum 
mechanical context this concept becomes problematic. The reason for this is the 
very feature of quantum mechanics which the Kochen-Specker theorem was intended 
to establish: namely, the fact that in quantum mechanics the concept of "the 
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original value of the observable being measured" is not always well-defined. Of 
course, one is free to make it well-defined, by taking a hidden variables approach. 
However, this way of arriving at a concept of quantum mechanical accuracy is not 
satisfactory because, quite apart from the fact that it compels one to favour a 
hidden variables approach over all the many alternatives, it makes the accuracy 
strongly dependent on which particular hidden variables theory is adopted. It is 
arbitrary, in other words. What one wants is a concept of accuracy which is (1) a 
natural generalization of the classical concept and (2) independent of the way in 
which the theory is interpreted (so that it is a feature of quantum mechanics as 
such, and not simply a feature of this or that particular interpretation). In the 
following we will present a solution to this problem. 

Let us start with the standard, elementary textbook example of a measurement 
process. Consider a system, with finite dimensional state space IKgy, and an appa- 
ratus, with finite dimensional state space IKap. Let A be a system observable acting 
on !Ksy, and let d be a pointer observable acting on IKap. Suppose that A and a 
have the same set of eigenvalues {a}, which for simplicity we will assume to be 
non-degenerate. Let |a)gy be the corresponding eigenvectors of the operator A, and 
let |a)g^p be the eigenvectors of d. Let |(/'o)ap be the initial "zeroed" or "ready" state 
of the apparatus, and let J2a^a \^)sy the initial state of the system. We then 
obtain an idealised measurement process by postulating that the unitary evolution 
operator U describing the interaction between system and apparatus is such that 

^((E^'^l«).y)®l'^o)ap) -E^'^(l«)sy®l«)ap) (1) 

^ a ^ a 

What makes this a measurement is the fact that it establishes a correlation between 
the system and pointer observables. What makes it ideal is the fact that the 
correlation is, in a certain sense, perfect. Specifically: 

1. The measurement is retrodictively ideal in the sense that, if the system was 
initially in the eigenstate of A with eigenvalue a, then there is probability 1 
that the recorded value of the pointer observable will also be a. Consequently, 
if the system was prepared in some unknown eigenstate of A, the result of 
the measurement can be used to retrodict, with certainty, which particular 
eigenstate it was. 

2. The measurement is predictively ideal in the sense that, if the pointer observ- 
able is recorded as having the value a immediately after the measurement, 
then one can predict, with probability 1, that a second, immediately subse- 
quent retrodictively ideal measurement of A will give the same value a. 

It is easily seen that these two properties, of retrodictive and predictive ideality, are 
independent. That is, there exist unitary evolution operators U describing processes 
which are retrodictively but not predictively ideal; and operators U describing 
processes which are predictively but not retrodictively ideal. 

Practically speaking perfection is seldom, if ever attainable. Consequently, one 
does not expect a real measurement process to be either retrodictively or predic- 
tively ideal. A more realistic model of a measurement process is obtained if, instead 
of Eq. (|l]) , we take the evolution to be described by 

^((E^- l«)sy) ® l'^0)ap) = E^-(l«)sy ® l«>ap) + E ^aeaM{\b),y ® M)ap) 
^ a 'a a.b,d 

(2) 

where |ea_M| <C 1 for all a, b, d. Of course, this model docs not include all the compli- 
cations which one might expect to find in a real measurement process. A complete 
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account should allow for the existence of other apparatus degrees of freedom, addi- 
tional to a. It should also allow for the interaction with the environment |^^, and 
for the fact that A and a may not have exactly the same spectrum. However, the 
model just indicated has the merit of simplicity, and it will serve to illustrate the 
essential ideas. 

If the coefficients ta,hd are sufficiently small, then the process described by Eq. (||) 
may be regarded as an approximate measurement of A: for, corresponding to the 
properties |^ and || above, we have 

3. The measurement is retrodictively good in the sense that, if the system was 
prepared in some unknown eigenstate of A^ then the result of the measurement 
can be used to retrodict, with a high degree of confidence, which particular 
eigenstate it was. 

4. The measurement is predictively good in the sense that, if the pointer observ- 
able is recorded as having the value a immediately after the measurement, 
then one can predict, with probability close to 1, that a second, immediately 
subsequent retrodictively ideal measurement of A will give the same value a. 

We next show how it is possible to quantify the degree of accuracy of the mea- 
surement. Define 

if = U'^AIJ 
Ckf — l]'^ aU 

Af, df are the final Heisenberg picture observables, defined at the moment the 
measurement interaction is completed. Let Ai = A denote the initial Heisenberg 
picture system observable, defined at the moment the measurement interaction 
begins. Define the retrodictive error operator ej and predictive error operator ef by 

ej = df - Ai (3) 

e[ — ai — A{ (4) 

Let §sy denote the unit sphere C Oisy Following the discussion in Appleby we 
now define the maximal rms error of retrodiction, A^A by 

AciA = ( sup (U ® 0o| ef IV' ® (l^a)) ] (5) 

and the maximal rms error of prediction, Aof^ by 

AefA = ( sup (g) (/)o| e1 h (g> 0o)) ) (6) 

Of these two quantities the predictive error Aof^ is the easier to interpret because 
if (unlike ej) connects Heisenberg picture observables defined at the same instant 
of time. Let \ip) G IHgy be the (normalised) initial system state. Then, reverting to 
the Schrodinger picture, 

from which we see that, the smaller A^fA, the more closely the result of a second, 
immediately subsequent, retrodictively ideal measurement of A may be expected 
to approximate the result of the (non-ideal) measurement under discussion. In 
particular, if AefA = 0, then the measurement is predictively ideal. It is not difficult 
to see that the condition Aof^ = is in fact, not only sufficient, but also necessary 
for the measurement to be predictively ideal. This justifies the interpretation of 
Aof v4 as providing a quantitative indication of the degree of predictive accuracy. 



8 



Let us now consider the interpretation of the quantity AdA. Suppose, to begin 
with, that the initial system state is an eigenstate of A with eigenvalue a. Then 

((V'0(/'o|C/^(a-a)^{7|V'<^0o))' = ((^ 0o| (af - a)^ jV- ^o)) ' < A^A 

from which we see that, the smaller A^A, the more closely the recorded value of the 
pointer observable may be expected to approximate a, and the more accurate the 
measurement is retrodictively. In particular, if A^A = 0, then the measurement is 
retrodictively ideal. It is not difficult to see that the condition A^A = is in fact 
both necessary and sufficient for the measurement to be retrodictively ideal. 

It is also possible to say something about the result of the measurement in the 
case when the initial system state is not an eigenstate of A. Let A and A A 
denote the initial state mean and uncertainty: 

A = (V'l ilV^) 
AA=(^{^\{A-A)^\^)y 

Then the spread of measured values about the initial state mean satisfies the in- 
equality 

i_ 1 
((V«)<^o|(af-^)^|V'«)0o))' < ((V'«)'/'o| (tif-ii)^|V'«)(/'o))' 

1 

+ ((V'^x/'ol {A~Af |V'®0o))' 

< AciA + AA (7) 

We see from this that there are two components to the spread of measured values. 
AA represents the intrinsic uncertainty of the initial system state. A^A represents 
an upper bound on the extrinsic uncertainty, attributable to the noise introduced 
by the measuring procedure. 

These considerations justify the interpretation of A^A as providing a quantita- 
tive indication of the degree of retrodictive accuracy. 

Finally we note that the necessary and sufficient condition for the coefficients 
ea.bd in Eq. (H) all to be zero (so that the measurement is completely ideal) is that 

AeiA = Aef A = 0. 

Let us now consider a joint measurement of several different observables. If the 
observables are mutually commuting then the above discussion generalises in the 
obvious way. However, the point which is important for the argument of this paper 
is that it also generalises, in a manner which is only slightly less obvious, to the 
case when the observables are not mutually commuting. It is true that one cannot 
make completely ideal joint measurements of a set of non-commuting observables. 
However, there is nothing to preclude one from making joint measurements which 
are only approximate. 

Let Ai, . . . , An be the non-commuting observables to be measured, acting on the 
system state space Oisy Corresponding to these observables we introduce a set of 
n pointer observables di, . . . , d„ acting on the apparatus state space Jfap. We take 
it that the observables ai, . . . , d„, unlike the observables Ai, . . . , An, are mutually 
commuting. Consequently, their joint eigenvectors constitute an orthonormal basis 
for JCap. Let |ai, . . . , an)^^ denote the joint eigenvector with eigenvalues ai, . . . , 
(for the sake of simplicity we assume that the eigenstates are non-degenerate). 
As before, let {(po) be the initial apparatus "zeroed" or "ready" state, and let 
U: J{sy (Xi JCap ^Kgy (E" JCap be the unitary evolution operator describing the 
measurement interaction. 
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The fact that the observables Ai, . . . , An are non-commuting means that we 
cannot choose a basis for Kgy which consists of their joint eigenvectors. However, 
we can choose, for each r separately, a basis which consists of eigenvectors just of 
A^. Let \a,x)^ be such a basis (where a denotes the eigenvalue, and the additional 
index x is to allow for possible degeneracies). We may then write 

J7n^ca.|a,a;),)(8|0o)j - c--faiby-,d,...d„{\b,y)r'»\du---,dn),p) 

a, a; b,y,di,...,d„ 



(r) 

for suitable coefficients fl^J.fjy.j^^ ^ ■ Suppose that, for all r, these coefficients have 

(r) 

the property that fg^J.f^y.ii ^ is small except when a = b = d^. Then, comparing 
this equation with Eq. (|^), we see that, for each r, the pointer provides an 
approximate measurement of the system observable A^. This situation may ap- 
propriately be described by saying that the process provides an approximate joint 
measurement of the set of observables Ai, . . . , An ■ 

As in the case of approximate measurements of a single observable, we may 
obtain a quantitative indication of the accuracy by making use of the Heisenberg 
picture observables Art = U^ArU, ctrf = U^ctrU, A^ = Ar- By analogy with 
Eqs. (|) and (§ define 

Cri — O^ri Afi 
iri = ttrf — Art 

We then obtain n maximal rms errors of retrodiction 

AeiA^ = (^sup ((V^ (g) (/)o| ili |V' -^o))^ (8) 
and n maximal rms errors of prediction 

Act A = ( sup ((Vj ® (jJol IV' ® (l^o)) ] (9) 

where Ssy denotes the unit sphere in the system state space Mgy, as before. 

Concerning the interpretation of the quantities AdAr, Aof^r the same analysis 
applies to them as was given for the errors characterising an approximate mea- 
surement of a single observable, in the paragraphs following Eqs. (||) and (^). In 
particular, we have, by analogy with Inequality 

(^{lp (g) 0o| (ttrf - Ar)^ \^ (g) (/)o)) ' < A^Ar + AAr 

for r = 1, . . . ,n, where Ar denotes the initial state mean, and AAr denotes the 
initial state uncertainty, as in Inequality (Q). 

Even though the Ar are non-commuting, it may still happen that there exist 
states for which the intrinsic uncertainties AAr are all small. If the retrodictive 
errors Aei^r are also small, then the above inequalities show that there is a high 
probability that, for each r, the recorded value of dr will be close to the initial 
state expectation value Ar — which provides a further illustration of the sense in 
which the processes under discussion may be regarded as approximate joint mea- 
surements. For examples of measurement processes to which these comments apply, 
see Appleby 0, |l3|, |l^, |l^, and Section | below. 
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3. Approximate Measurements: POVM 

An approximate measurement is most conveniently analysed in terms of the cor- 
responding POVM (positive operator valued measure). We avoided introducing 
this concept at the outset because we wished to establish that one can give an ade- 
quate theoretical description of approximate measurements whilst remaining wholly 
within the framework of the conventional theory, as it was presented by Dirac 
and von Neumann | |2^ . In particular, we wished to establish that one can introduce 
the concept of an approximate measurement, without being thereby compelled to 
introduce any unconventional, unsharp or generalized observables. However, it is 
certainly true the concept of a POVM represents a powerful mathematical tool. 
Consequently, having established that it is not anything more than a tool (at least 
in the present context), it is appropriate to indicate how the maximal rms errors 
defined in Section ^ can be expressed in terms of this construct. 

As in the last section, we consider a measurement of n non-commuting observ- 
ables Ai, . . . ,An acting on the system state space 3isy ■ The system is coupled to n 
commuting pointer observables di, . . . , d„ acting on the apparatus state space IKap. 
Let |ai, . . . , a„) be the joint eigenvector of di, with eigenvalues ai, . . . ,an 

(which, for simplicity, we assume to be non-degenerate). Let |0o) be the initial 
apparatus state, and let U be the unitary evolution operator describing the mea- 
surement interaction. Let |m) be any orthonormal basis for the system space IKsy. 
Define, for each n-tuplet ai, . . . , o„, 

fa„...,a„^ Y,{H'»{ai,---,an\)U{\m')^\(j)o))\m){m'\ (10) 

Unlike U, which acts on the product space "Kgy 'K^p, the operators Tai,...,a„ act 
just on the system space !Ksy. 
Let 

It is easily verified that Eai,...,ar, is the POVM describing the measurement outcome. 
In other words, the probability that, immediately after the measurement, the n 
pointers will be recorded as having the values ai, . . . , a„ is 

Pai,...,a„ = (V'l ^ai,...,a„ jV') 

where jV') is the initial state of the system, immediately before the measurement. 
It is also convenient to define 

ai , . . .,0^.— 1 ,a^^i ,...,an 

where the summation is over every index except for a^. Ea,: is the POVM describing 
the outcome of the measurement just of Ar, which is obtained by ignoring the other 
n — 1 pointer readings. Thus, the probability that the r^^ pointer reading will be 
ttr is given by 



Pa 



(r) _ 



Starting from the definition of Eq. (g) it is not difficult to show that the r^^ retro- 
dictive error Aoi^r is given by 

AciA = ( sup ((V'l V(ir - ar)Ei[HAr - a^) 
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or, equivalently, 

1 

AeiA = (||^(ir. - ar)Eil\Ar - aO||] ^ (12) 

where || • || denotes the operator norm. Similarly, the r*^ predictive error may be 
expressed 

1 

AefA=(|| E ^'a\,...,a„(i.-«.rt,...,a„||)' (13) 



ai ,...,a 



4. Approximate Measurements and "Unsharp Observables" 

As we stressed earlier, the approach described in Sections |^ and ^ differs from 
the approach of many other authors in that we make no use of the concept of an 
"unsharp observable" , or of what Clifton and Kent |^ refer to as a "generalized 
observable" . In this section we discuss Uffink's criticisms (also see Fleming ||2^ ) 
of this way of describing joint measurements of non-commuting observables. The 
discussion will prove relevant in Section ^, where we consider Clifton and Kent's |^ 
argument "to rule out falsifications of non-contextual models based on generalized 
observables, represented by POV measures" . 

Historically, work on the application of POVM's to the theory of measurement 
has been strongly influenced by the fact that, from a mathematical point of view, 
the concept of a POVM (positive operator valued measure) is a generalization of the 
concept of a PVM (projection valued measure). There consequently arose the idea 
that, since observables of the ordinary, orthodox kind are represented by PVMs, 
therefore a POVM which is not also a PVM must represent an observable of a 
different, unorthodox kind. 

If one takes such a view, then one has to suppose that what would naturally 
be regarded as an approximate measurement of (for example) position, is in fact a 
(non-approximate?) measurement of something else — unorthodox, or generalized, 
or unsharp position as it might be called. This way of thinking is certainly at 
variance with our ordinary intuitions. It would, for instance, not normally be 
argued that a ruler cannot be used to measure length properly so-called, but only 
generalized length. However, this objection is perhaps not crucial, for one does not 
expect quantum mechanical concepts necessarily to accord with classical intuition. 
Nevertheless, there are some pertinent questions regarding the interpretation of 
generalized observables which need to be answered if the concept is to be acceptable. 
As Uffink puts it: "one would naturally like to know what is being measured in a 
measurement of an unorthodox observable" (his emphasis). 

The difficulty becomes particularly acute when it is approximate joint measure- 
ments of non-commuting observables which are in question. Proponents of the 
concept of an unsharp observable argue that, although (for example) the orthodox 
position and momentum observables cannot jointly be measured, there exists a dif- 
ferent pair of unorthodox, "unsharp" observables which are jointly measurable. As 
Uffink points out, the problem with this approach is that, rather than solving the 
original problem (the problem of making a joint measurement of a pair of orthodox 
observables) , it merely presents us with a solution to a new, ostensibly quite differ- 
ent problem (the problem of making a joint measurement of a pair of unorthodox 
observables). Advocates of the approach attempt to deal with this problem by ar- 
guing that the unorthodox observables which one actually measures are related to 
(are unsharp versions of) the orthodox observables which one would like to measure 
in such a way that, by making a (non-approximate?) measurement of the former. 
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one acquires approximate information regarding the latter. However, Uffink has 
identified some problems with this idea |2^, . 

It appears to us that the source of the difficulty lies in the concept of an unsharp 
observable which, at least so far as approximate measurements are concerned, adds 
a wholly unnecessary level of complication to the problem. In classical physics there 
is no need to introduce the concept of an "unsharp" quantity, and then attempt 
to show that, by measuring that, one gains approximate information about the 
ordinary quantity in which one is really interested. It turns out that there is no need 
to introduce such intermediate quantities in quantum physics either — as we showed 
in Sections || and || where (using ideas previously presented in Appleby ||l^, 
p^ , p^ , p9[ ) we described approximate quantum mechanical measurements directly, 
without any unnecessary detours, as measurements of the self-same (orthodox) 
observables concerning which approximate information is sought. 

As was shown in Section ^, the concept of a POVM plays an important role in 
our analysis. However, its role is simply that of a powerful mathematical construct, 
which can be used to describe the outcome of an approximate measurement. It is 
not taken to represent a new kind of observable, distinct from the observable one 
is trying approximately to measure. 

It should be stressed that the above discussion only applies to approximate mea- 
surements of (orthodox) observables. In other contexts we would agree that the or- 
thodox identification of "observable" with "self-adjoint operator" is too restrictive — 
as appears from the fact that, if this identification is correct, then phase and time 
are not observables (see, for example, Busch et al Pegg and Barnett [ p8| , 

Buzek et al |^, Oppenheim et al Q, Egusquiza and Muga and references 
cited therein). It is also clearly true that the concept of a POVM plays an impor- 
tant role in the problem of arriving at a suitably extended concept of a physical 
observable. We only wish to point out that the question is not straightforward, 
and that a simple identification of the concept of a POVM with the concept of a 
generalized observable may be productive of confusion. Some of the pitfalls appear 
from the discussion in Uffink's paper. Others will appear from the discussion in 
Section 0. 



5. Approximate Joint Measurements of the Projections (e,, • L)^ 

We now specialise the theory presented in Sections ^ and ^ to the case which will 
be discussed in the next two sections, of an approximate joint measurement of the 
projections (ei • L)^, (e2 • L)^, (63 • L)^ where L is the angular momentum operator 
for a spin 1 system, and where the unit vectors ei, 62, 63 are approximately, but 
perhaps not exactly orthogonal. 

Let us start by considering an exact measurement of the single projection P = 
(n • L)^, for an arbitrary unit vector n. The system state space "Ksy is thus 3- 
dimensional. To measure P we couple the system to a single pointer observable a 
which has the two (non-degenerate) eigenvalues and 1. The apparatus state space 
IKap is thus 2-dimensional. Let |0), |1) be the eigenvectors of a with eigenvalues 
and 1 respectively. Let & : !Kap IKap be the operator defined by 

a\Q) = -i\l) '^|1>=^|0) (14) 

Let U : yisy ® JCap 3^sy ® Map be the unitary operator defined by 
U — exp 



^^Pa 
. 2 



(1 - P) + iPa 
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Let the initial apparatus state be {(po) = |0). Then 

^' ' ' ifP|^) = |^) 

from which we see that U describes a completely ideal measurement of P. 

In order to obtain a joint measurement of the three projections Pi = (ei • L)^, 
P2 = (e2 • L)2, P3 = (63 • L)2 we can chain together three ideal measurements of 
the kind just described, so that Pi is measured first, P2 second and P3 third, as 
illustrated in Figure ^ We then have three commuting pointer observables di, a2, 
ds acting on the 6-dimensional space IHap. Let |ai, a2, as) be the joint eigenvector 
of di, q;2, ds with eigenvalues ai,a2,a3- The unitary operator describing the 
measurement interaction is 

U = U3U2U1 = ((1 - P3) + iPsCTs) ((1 - A) + iP2^2) ((1 - A) + iPiCTi) (15) 
where the operators ct^ are defined by 

(Ti |Q;i,a2,a3) = (-1)"'? |di,a2,a3) 
(T2 |ai, 012,03) = (-l)"=i 101,02,03) 

(T3 101,02,03) = (-1)"^* |0i,02,d3) 

and where we have employed the notation 0=1,1 = 0. Referring to Eq. (|l^) we 
see that 

Taia^aa = (^"^1 ® («! , "2 , ^3 I ) ( |"l') ® |0,0,0)) |m) (m'| 

m,m' 

where \m) is any orthonormal basis for IKsy ■ Defining P^"'' = 1 — A-, A-^'' = A this 
becomes 

rp _ piaa) p{a2) r>{ai) 

^aia2a3 — ^3 ^2 ^1 

Using Eq. (11), and the fact that the P^"^^ are projections, the POVM describing 
the measurement outcome is 

p _ p("i) p("2) 0(03) p(a2) p(ai) 

^010203 — ^1 ^2 ^3 ^2 ^1 

We may assume that the basis in M"^ has been chosen in such a way that 
l\ / sin lbs /sin cos ( 




ei = I U I 62 = I cos ip I ^3=1 smf sm( 

cos 9 

where the angles ip, 6 (but not necessarily (j>) are small. There is no loss of generality 
in assuming that the basis is right handed (since we are free to adjust the signs 
of the Gr). Making this assumption, and working to second order in O^tp, we find 
(after some rather lengthy algebra) 
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where {L2,L3} denotes the anti-commutator. 

{^010203} is the probabihty that the measurements of Pi, P2, P3 wiU give the 
values ai, a2, as respectively. E ^/j = 9 = then i^aiaaas reduces to the PVM 
(projection valued measure) 

Eiio — I ~ P3 Eioi = 1 — P2 Eon ~ 1 — Pi 

Em = -Eioo = -E-oio = ^001 = £'001 = 

and the probability of the outcome of the measurement being one of the "illegal" 
combinations 111, 100, 010, 001, 000 is zero. If, however, 6, ip are not both = 0, 
then the probability of obtaining one of these combinations, though small, is not 
exactly zero — as was to be expected. 

Using Eq. (^2|) we obtain (after some algebra) the following expressions for the 
retrodictive errors, to lowest order in ip, 9: 

AeiPl « (16) 

AciPa « IV'I (17) 
AeiPa ~ \0\ (18) 

For the sake of completeness we also give the formulae for the predictive errors, to 
lowest order in ip, 9: 

AcfPi « (2(V/ + 9"^ cos^ 0)) ^ (19) 

AefPa « |\/20sin(^| (20) 

AefPa ~ (21) 

It is not difficult to see that the equalities AeiPi = AcfPs = are actually ex- 
act. This is because the joint measurement is constructed by stringing together a 
sequence of measurements which are individually ideal. The errors are entirely at- 
tributable to the disturbance of the system caused by the successive measurements. 
The measurement of Pi comes first, there has been no preceding measurement to al- 
ter the state of the system, and so it is retrodictively ideal: which is why AciPi ~ 0. 
The measurement of P3 comes last, there is no subsequent measurement to alter 
the state of the system, and so it is predictively ideal: which is why AofPs — 0. 

We have assumed that the measurements of the three operators Pr are per- 
formed sequentially, one after the other, because that is the easiest case to analyse. 
However, it is perfectly possible to apply these methods to cases where the three 
measurements are performed all at once (so to speak). For instance, one might 
consider the evolution described by the Hamiltonian 

Hsy and H^p are the Hamiltonians describing the free evolution of the system and 
apparatus respectively. i?moas is the time-dependent Hamiltonian describing the 
measurement interaction, given by 

where / is a "bump" function, which is zero outside the short time interval [0, r], 
and which satisfies the normalisation condition dtf(t) = 1. If r is sufficiently 
small then the unitary operator describing the evolution between t — and t — t 
is approximately given by 



U' « exp 



■ 7f 
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If the vectors e,. are exactly orthonormal, then U' comcides with the operator U 
given by Eq. (^5|). Otherwise it does not. However, it is easily seen that it still 
describes an approximate joint measurement of the operators Pr- 

Every measurement occupies a finite time interval. There is no difference in 
principle between joint measurements which are performed sequentially, so that 
the measurement of each individual observable is allotted its own individual time- 
slice; and joint measurements which are performed contemporaneously, so that the 
measurement of each individual observable takes up the whole of the time which is 
allotted to all. 

6. A Modified Kochen-Specker Argument 

We now apply the concepts and methods developed in the last four sections to 
the questions posed in the Introduction. We consider a spin 1 system, with angular 
momentum L; and we consider the problem of making a joint measurement of the 
observables (e^ • L)^, for some triad e^. 

The argument in this section was originally inspired by some of the points made 
by Mermin However, it appears to us that our formulation is considerably 
sharper than Mermin's. Moreover, Mermin does not remark on the need to assume 
that the alignment errors are statistically independent (see below). 

Before proceeding further, it will be helpful to introduce some terminology. Sup- 
pose that an analyzer is designed to measure the observable (n • L)^ but, due to 
the imprecision in the alignment of the analyzer, does in fact measure the slightly 
different observable (n' • L)^, where n and n' are both unit vectors. Then we will 
refer to (n • L)^ as the target observable, and to the unit vector n as the target 
alignment; while (n' • L)^ will be referred to as the actual observable, and n' as the 
actual alignment. 

Kochen and Specker make no allowance for the imprecision in any real measure- 
ment procedure. They consequently assume that the measurements they consider 
are all ideal (in the sense explained in Section |^), and they assume that the ac- 
tual observables exactly coincide with the target observables. They further assume 
that, in a measurement of the three observables (e^ • L)^, the vectors can be any 
orthonormal triad contained in the real unit 2-sphere, 5*2. 

MKC, by contrast, recognise that the imprecision of any real experimental proce- 
dure means that the actual alignments may be slightly different from the target 
alignments e^. This permits them to make the crucial postulate, that the actual 
alignments are constrained to lie in a proper, dense subset 5*2 C 5*2. However, 
MKC continue to assume that the triad is precisely orthonormal, and that the 
measurements of the observables (ej, • L)^ are all ideal. As we discussed in the 
Introduction, these assumptions are unduly restrictive. The argument of MKC has 
little force unless it can be extended to the case when the triad is not precisely 
orthonormal and when, in consequence, the measurements of the (non-commuting) 
observables (ej, • L)^ are not ideal. This is the problem we now address. 

The argument which follows does not depend on any assumption regarding the 
specific manner in which the measurements of the observables (ej, • L)^ are per- 
formed. The measurements could be performed sequentially, by three separate 
analyzers, as illustrated in Fig. |l|. However, the argument applies equally well to 
the case when the measurements are performed contemporaneously, by a single 
piece of apparatus, as discussed at the end of Section ||. 

In order to proceed it is necessary to make a definite hypothesis as to the distri- 
bution of actual alignments corresponding to a given target alignment. The most 
straightforward hypothesis, and the assumption on which the argument of this sec- 
tion will be based, is that the actual alignments are distributed randomly. Wc will 
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further assume that, in the case of an apparatus which is designed to measure several 
different target observables, the actual observables are distributed independently. 
In other words, we assume that, for each possible target alignment n, there is a 
probability measure fin defined on the set ^2 (i.e., the set to which MKC postulate 
that the actual alignment must belong) such that, in a measurement of the target 
observables (ei • L)^, (e2 • L)^, (ea • L)^, the probability that the actual alignments 
e[, e'2, eg lie in the set Ai x A2 x A3 C S2 ^ S'2 ^ S'2 is ^61(^1)^^62(^2)^63(^3)- 

The probability measure fin depends, not only on the vector n, but also on the 
construction of the apparatus. An apparatus which was constructed differently, 
so as to permit the alignments to be fixed more precisely, would have a different 
associated probability distribution. 

The argument of this section is crucially dependent on the assumption that the 
alignment errors are statistically independent. We will consider the hypothesis that 
the distributions are not independent in Section 0. 

We will denote the set of possible target alignments 82- The question arises: 
what, exactly, is this set? What conditions must a vector satisfy in order to be a 
possible target alignment? One could argue that every vector G S'2 is a possible tar- 
get alignment. However, it might be thought that this view would be too extreme. 
The target observable is the observable which the analyzer is designed to measure; 
and it may reasonably be argued that it is possible to design an instrument to mea- 
sure some observable if and only if it is possible unambiguously to describe that 
observable. We will accordingly take the view that S'2 , the set of possible target 
alignments, consists of those unit vectors g S2 which are finitely specifiable — that 
is, which can be specified in standard mathematical notation, by means of a string 
consisting of finitely many characters. 

The set S'2 so defined is countable, like the set 82- However, unlike the set 
S2, the set S'2 includes Kochen-Specker (KS) uncolourable sets. For instance, it 
includes the uncolourable set given by Kochen and Specker [Q themselves, and the 
one given by Peres 0, ^ (the vectors belonging to these sets manifestly are finitely 
specifiable, since the authors explicitly do so specify them). It follows that Sj is 
itself KS uncolourable. 

The fact that S'2' is KS uncolourable is crucial. It opens the way to a modified 
version of the KS theorem. 

Now consider a valuation /: S2 ^ {0, 1}. The fact that S2 is countable means 
that / is automatically /in-measurable, for every n e S2 . Consequently, we may 
define for each n € S'2 , 



p{n) is the probability that, if the analyzer is designed to measure the target ob- 
servable Pn, then the vector n' S Sj characterising the actual alignment of the 
analyzer will have /-value = I. 

Using the function p{n) we next define an induced valuation /: S2 ^ {0, 1} by 



The valuation / is defined on the set S'2, which is KS colourable. However, the 
induced valuation / is defined on S2 which, as we have seen, is KS uncolourable. It 
follows that there must exist an orthonormal triad ei, 62, eg g S'2 which /-evaluates 
to one of the "illegal" combinations 111, 100, 010, 001, 000. 

Let ei, 62, eg G S'2 be such a triad, and suppose that the sequence of three ana- 
lyzers illustrated in Fig. |^is used to make a joint measurement of the corresponding 
target observables, (ei • L)^, (e2 • L)^, (eg • L)^. Let e[,e2,e'^ € S'2 represent the 



p{n)^fin{W eS^:f{n') = l}) 




if p(n) < 0.5 
if p(n) > 0.5 



17 



actual alignments of the analyzers. We will assume that the precision with which 
the analyzers can be aligned is very high, so that the triad e'j^, Gj, e'^ is very nearly 
orthonormal. 

It follows from the definitions of /, / that, for each r, there is probability > 
0.5 that /(e^) = /(e,). Consequently, there is a non-negligible probability that 
e[,e'2,e'^ /-evaluates to one of the "illegal" combinations 111, 100, DIG, GDI, 000 
(in fact it is straightforward, though somewhat tedious to show that the probability 
of obtaining one of these combinations is > G.5). On the other hand, the fact that 
the triad e[,e2,e'.^ is almost orthonormal, and the results proved in Section 
together imply that the probability that the result of the measurement will be one 
of these combinations is « G. It follows that there is non-negligible probability (in 
fact, probability > G.5) that, for at least one of the observables {e[ ■ L)^, (ej • L)^, 
(gj • L)^, the measured value is not close to the /-value. 

This establishes that, if the alignment errors are random, and statistically inde- 
pendent, then the model must exhibit a form of contextuality: for it means that 
the probable outcome of an approximate measurement must, in general, be strongly 
dependent, not only the observable which is being measured, but also on the par- 
ticular way in which the measurement is carried out. It follows that, if the stated 
assumptions are true, then the model fails to satisfy clause |^ of the AOV principle 
(as stated in the Introduction). 

7. The Assumption of Independence 

It is easily seen that the assumption that the alignment errors are statistically 
independent is crucial to the argument in the last section. For instance, one can 
envisage a model in which the errors are correlated in such a way that the actual 
alignments are always orthogonal. In that case the situation would reduce to the 
one considered by MKC. It should, however, be noted that this assumption, besides 
being somewhat implausible, is empirically falsifiable. We have, until now, been 
following MKC in assuming that the discrepancies between target and actual align- 
ments are a consequence of the limited precision of the measuring device. However, 
one can equally well consider a case where the alignment "errors" are artificially 
controlled, using a random number generator, to be much larger than the minimum 
attainable errors (and consequently measurable). The argument of the last section 
applies to this case just as well as to the case when the errors are due to the finite 
precision of the instrument. Consequently, if one wished to avoid the conclusion to 
that argument in the manner suggested, then one would have to postulate that it 
is physically impossible to set up the apparatus in such a way that the angles be- 
tween the alignments are measurably different from 90° — which (quite apart from 
the implausibility of the suggestion) is a definite empirical prediction. 

Of course, the fact that, in the MKC models, for each of the values and 1, the 
set of vectors which are assigned that value constitute a dense subset of S2 , means 
that these models are very flexible. Consequently, it may be that there exist other, 
rather more subtle, postulates regarding the distribution of actual alignments which 
are not empirically falsifiable, and which do have the property that clause || of the 
AOV principle is then satisfied. However, it would not be very easy to prove that 
this was the case. In a complete hidden variables theory, the probability measure 
describing the distribution of actual alignments is not a feature which one is free 
simply to postulate. It has to be derived, from the dynamics of the interacting 
system-|-apparatus-|-environment composite. The models discussed by MKC are 
incomplete, since they do not include a specification of the dynamics. It is a highly 
non-trivial question, as to whether there exists a dynamics which, in every situation, 
gives rise to a probability distribution having the desired properties — not only in 
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situations where the ahgnment errors arise "naturaUy" , but also in situations where 
the errors are adjusted "by hand" (in the manner described in the last paragraph). 
In the absence of a solution to this problem, the question as to whether there exist 
hidden variables theories satisfying clause ^ of the AOV principle must be regarded 
as open. 

Suppose, however, that a suitable dynamics could be constructed. Regarded from 
the perspective of classical intuition the carefully adjusted correlations between 
the different alignment errors in a complex apparatus which such a model must 
exhibit would seem very peculiar (they would have something of the flavour of a 
"conspiracy"). However, what is perhaps rather more to the point is the fact that 
this phenomenon would itself represent a kind of contextuality: for it would mean 
that the statistical fluctuations in an analyzer were ineluctably dependent on the 
overall experimental context in which the analyzer was used. A theory of this kind 
(supposing that it could be constructed) would not so much nullify the contextuality 
asserted in the conclusion to the Kochen-Specker theorem, as change the locus of 
the contextuality, from the system, onto the alignment errors. 

The conclusion consequently seems to be that, although one may modify its 
precise form, some kind of contextuality must appear somewhere, in any hidden 
variables theory. 

The problem of trying to nullify the non-classical elements in a hidden variables 
theory might be compared with the problem of trying to nullify the rucks in a 
badly fitted carpet. The carpet corresponds to the hidden- variables theory. The 
nails holding the carpet down correspond to the empirical data. One has a certain 
amount of freedom to move the rucks around; and with sufficient ingenuity one may 
succeed in making them less noticeable. However, so long as the nails remain in 
place, the rucks cannot actually be eliminated. 



8. Clifton and Kent's POVM Theorem 

In addition to the part of their argument which we have been considering up to 
now, Clifton and Kent Q also prove a theorem which, according to them, "rule[s] 
out falsifications of non-contextual models based on generalized observables, rep- 
resented by POV measures" (also see Kent |^). As we have seen, an approximate 
joint measurement of non-commuting observables is most conveniently described in 
terms of a POVM. Clifton and Kent's theorem 2 also concerns measurements whose 
outcome is described in terms of a POVM. Consequently, it may at first sight seem 
that their theorem 2 contradicts the result proved in Section ^ of this paper. One 
purpose of this section is to show that this is not in fact the case. The other purpose 
is to point out that, although Clifton and Kent's theorem 2 is valid if regarded as 
a piece of pure mathematics, its physical significance is more questionable. 

Clifton and Kent show, that given any finite dimensional Hilbert space Jf, there 
exists a set Ad of positive operators acting on IK, and a truth function tji : Ad 
{0, 1} such that 

1. If {Ai} C Ad is a finite positive operator resolution of the identity (so that 

= 1), then 

i 

2. the set of finite positive operator resolutions of the identity contained in Ad is 
a countable, dense subset of the set of all finite positive operator resolutions 
of the identity ( "dense" relative to a topology defined in their paper) . 
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They further argue that the set of all such functions tyi is sufficiently large for the 
theory to be able to recover the statistical predictions of any density operator (in 
so far as these are testable using finite precision instruments). 

In considering this claim we note, first of all, that every projection is also a 
positive operator. Consequently, the set of admissible projections which features 
in Clifton and Kent's theorem 1 should be contained in the set of admissible 
positive operators which features in their theorem 2. Also, for any given assignment 
of hidden variables, the truth function : ~* {0, 1} should be the restriction to 

of the truth function tA ■ -Ad — > {0, 1}. It is not entirely clear from Clifton and 
Kent's paper that these requirements can be satisfied. For the sake of argument, 
we will assume that they are satisfied. 

We next address the question, as to what, if any, connection exists between 
Clifton and Kent's theorem 2 and the result which we proved in Section || of this 
paper. It is true that both of these results concern measurements whose outcome is 
described by a POVM (positive operator valued measure) which is not also a PVM 
(projection valued measure). However, there is a significant difference between the 
way in which the POVM is interpreted. As we stressed in Section ^, the measure- 
ments which we consider are approximate measurements of ordinary observables. 
The role of the POVM is simply to provide a convenient mathematical descrip- 
tion of the measurement outcome. By contrast, Clifton and Kent take it that the 
POVM's which feature in their theorem (and which are not also PVM's) represent 
an entirely new species of generalized observable. 

Clifton and Kent do not provide any explicit details, as to how these generalized 
observables are to be measured. However, they appear to assume that the function 
tji-. Ad ^ {0, 1} can be specified independently of the function ty. {0, 1} (as 

we noted above, they do not even explicitly impose the requirement that I'd should 
be contained in ^Id, and that should be the restriction oft a). This suggests that 
they are assuming that, corresponding to the new class of generalized observables, 
there exists a new class of generalized measurements. 

In the approximate measurement procedures which we described in Sections |^, 
H and H the final result is obtained by recording the pointer positions. The pri- 
mary mathematical construct describing the result of the measurement is thus the 
PVM which gives the distribution of pointer positions. The POVM is a secondary 



construct which is defined in terms of this PVM [see Eqs. (10) and (pT[)]. Let 
Eai,...,a„ be the element of the POVM which describes the probability that the 
pointer positions will be ai, . . . , a„. Then the admissibility of -E'ai,...,a„ , as an op- 
erator describing the outcome of a physically possible approximate measurement 
process, entirely depends on the admissibility of the corresponding projection op- 
erator, acting on the apparatus state space. That is, £'ai,...,a„ is admissible if and 
only if the corresponding projection belongs to the set Tj^^, of admissible apparatus 
projections. Moreover, if Eai,...,a„ is admissible, then it should be assigned the 
same truth value as the corresponding apparatus projection. Clifton and Kent, on 
the other hand, because they interpret the POVM as the mathematical represen- 
tation of a completely different kind of observable, assume that they are free to 
fix the set Ad and truth function tji : Ad — > {0, 1} without having any regard for 
the apparatus set and truth function ty': CPd {0,1}- Consequently, their 
theorem 2 has no bearing on the result proved in Section |^ of this paper. 

Until now we have been concerned with the relationship (or lack of relationship) 
between Clifton and Kent's theorem 2 and the result which we proved in Section n. 
However, similar considerations also show that there are certain obscurities regard- 
ing the significance of Clifton and Kent's result, even when it is interpreted in their 
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suggested terms. It is, of course, the case that not every POVM arises in the man- 
ner discussed in this paper, as a way of describing the outcome of an approximate 
measurement of an ordinary observable. In other contexts it may be appropriate to 
think of a POVM as representing a generahzed observable. But, whatever the entity 
is called, one always needs to specify how it is to be measured. The usual answer 
to this question makes use of the Neumark extension theorem Jt], pO| , |3^ , 
In the variant of this approach which was proposed by Peres pT 35 , the system 
of interest is combined with an ancilla, and an ideal measurement is performed on 
the composite. The outcome of this measurement is described by a PVM. It fol- 
lows that, in this case too, once one has fixed the set of admissible projections 
for the system-|- ancilla composite, one is no longer free to omit the corresponding 
system-space positive operators from the set of admissible positive operators. 

It can be seen from this that one has less freedom to choose the set Ad than 
Clifton and Kent assume. The situation regarding the truth values which should 
be assigned to the members of this set is even more problematic. It is not simply 
that these values are already partially determined by the function ty describing 
the pointer observables (in the case of an approximate measurement procedure of 
the kind discussed in Section ||), or the system-|- ancilla composite (in the case of 
Peres' version of the Neumark construction). It is not even clear that the truth 
values are determined unambiguously; for it may happen that a given POVM can 
be physically realized in more than one way. Suppose, for example, that the POVM 
described in Section || is alternatively realized by Peres' version of the Neumark 
construction. In both cases, the outcome is fixed once the relevant functions are 
fixed; but it is far from clear that the outcomes will be the same. In short, it is 
questionable whether it is appropriate to think in terms of there being a well-defined 
truth function on the set Ad- 

9. Conclusion 

In this paper we have made a number of criticisms of the arguments of MKC. 
This should not be allowed to obscure the fact that their work is, in our view, both 
interesting and valuable. MKC have taken a question which seemed clear-cut, and 
shown that it is in fact much more subtle and intricate than had previously been 
appreciated. They have thereby significantly deepened our understanding of the 
conceptual implications of quantum mechanics. 

Similar qualifications apply to our critical remarks (based on Ufhnk's ||2^ crit- 
icisms) concerning the use of the concept of an unsharp observable to describe 
approximate measurements of ordinary observables. As we stressed at the end of 
Section ^, these criticisms should not be taken to imply that we question the need 
for an extended concept of physical observable. 
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